Subword unit based speech recognition in car environments

نویسندگان

Alexander Fischer

Volker Stahl

چکیده

This paper presents results of speaker-independent speech recognition experiments concerning acoustic front-ends, models and their structures in car environments. The database comprises 350 speakers in 6 different cars. We investigate whole-word models, contextindependent phoneme models and context-dependent within-word phoneme models. We studied task-dependent (same vocabulary context in training and test) phoneme models and present first results on task-independent (broad context in training, i.e. phonetically rich material) scenarios. The latter allows flexible vocabulary definition for applications with dynamically changing command words or new applications avoiding an expensive data collection. Acoustic preprocessing is carried out with mel-cepstrum combined with spectral subtraction and SNR normalization. The task-dependentword error rates are well below 3% for both wholeword and phoneme models. The task-independent scenarios have to be worked on further.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition Using Demi-Syllable Neural Prediction Model

The Neural Prediction Model is the speech recognition model based on pattern prediction by multilayer perceptrons. Its effectiveness was confirmed by the speaker-independent digit recognition experiments. This paper presents an improvement in the model and its application to large vocabulary speech recognition, based on subword units. The improvement involves an introduction of "backward predic...

متن کامل

Word recognition using hidden Markov models and neural associative memories

for his interest in this thesis and his valuable advice. I thank my mentor Dr. Friedhelm Schwenker for his reading and his helpful recommendations. My thanks go also to Dr. Muhamed Qubbati and David Bouchain for a critical reading and for their useful suggestions. I also have to thank the Graduate School, University of Ulm whose doctoral scholarship financed this thesis. Further thanks go to my...

متن کامل

Data driven subword unit modeling for speech recognition and its application to interactive reading tutors

This paper proposes a novel token-passing search architecture for supporting subword unit based speech recognition and a corresponding algorithm based on the well-known LZW text compression method to determine a vocabulary of subword units in an unsupervised manner. We compare our subword unit selection algorithm to an existing approach based on Minimum Description Length (MDL) modeling and als...

متن کامل

Are Initial / Final Units Acoustically Accurate ?

| We show a comparative study of subword unit segmentation of Mandarin speech data. Most HMM recognition systems use intial//nals as subword units for Mandarin speech. We nd that such a division of monosylla-ble data into intial//nal units are not always supported by acoustic evidences. We implement a delta MFCC based seg-mentation method and compare its output with that of Viterbi segmentation...

متن کامل

Combined Optimisation of Baseforms and Model Parameters in Speech Recognition Based on Acoustic Subword Units

A major challenge in speech recognition is creating a lexicon which is robust to inter-and intra-speaker variations. This is even more so in speech recognisers based on non-linguistic units, e.g., acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the baseforms describing the vocabulary words in terms of the recognition units need to be generated fr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Subword unit based speech recognition in car environments

نویسندگان

چکیده

منابع مشابه

Speech Recognition Using Demi-Syllable Neural Prediction Model

Word recognition using hidden Markov models and neural associative memories

Data driven subword unit modeling for speech recognition and its application to interactive reading tutors

Are Initial / Final Units Acoustically Accurate ?

Combined Optimisation of Baseforms and Model Parameters in Speech Recognition Based on Acoustic Subword Units

عنوان ژورنال:

اشتراک گذاری